Estimating predicted probabilities from logistic regression: different methods correspond to different target populations.

نویسندگان

  • Clemma J Muller
  • Richard F MacLehose
چکیده

BACKGROUND We review three common methods to estimate predicted probabilities following confounder-adjusted logistic regression: marginal standardization (predicted probabilities summed to a weighted average reflecting the confounder distribution in the target population); prediction at the modes (conditional predicted probabilities calculated by setting each confounder to its modal value); and prediction at the means (predicted probabilities calculated by setting each confounder to its mean value). That each method corresponds to a different target population is underappreciated in practice. Specifically, prediction at the means is often incorrectly interpreted as estimating average probabilities for the overall study population, and furthermore yields nonsensical estimates in the presence of dichotomous confounders. Default commands in popular statistical software packages often lead to inadvertent misapplication of prediction at the means. METHODS Using an applied example, we demonstrate discrepancies in predicted probabilities across these methods, discuss implications for interpretation and provide syntax for SAS and Stata. RESULTS Marginal standardization allows inference to the total population from which data are drawn. Prediction at the modes or means allows inference only to the relevant stratum of observations. With dichotomous confounders, prediction at the means corresponds to a stratum that does not include any real-life observations. CONCLUSIONS Marginal standardization is the appropriate method when making inference to the overall population. Other methods should be used with caution, and prediction at the means should not be used with binary confounders. Stata, but not SAS, incorporates simple methods for marginal standardization.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting amphipod toxicity from sediment chemistry using logistic regression models.

Individual chemical logistic regression models were developed for 37 chemicals of potential concern in contaminated sediments to predict the probability of toxicity, based on the standard 10-d survival test for the marine amphipods Ampelisca abdita and Rhepoxynius abronius. These models were derived from a large database of matching sediment chemistry and toxicity data, which includes contamina...

متن کامل

Tools and Technology Note A Comparison of Methods for Estimating Northern Bobwhite Covey Detection Probabilities

We compared the time-of-detection and logistic regression methods of estimating probability of detection for northern bobwhite (Colinus virginianus) coveys. Both methods are unusual in that they allow estimation of the total probability of detection (i.e., the product of the probability that a covey is available for detection [i.e., that a covey vocalizes] and detection given availability). The...

متن کامل

بررسی عوامل روان‌شناختی مرتبط با گذر از مراحل‌ مصرف سیگار در نوجوانان

  Introduction : Previous studies have suggested that early smoking initiation predicts longer duration of smoking, heavier daily consumption and increased chances of nicotine dependence. The goal of the present study was to examine the effect of psychological factors on three transitions in the adolescent smoking uptake process: from never smoking to experimentation and regular smoking, and fr...

متن کامل

Prediction of Non-exercise Iranian Cardiorespiratory Fitness and Investigation the Effective Components on Society: an analytic (case-control) study

Background & Aims: In large populations, the level of cardiorespiratory fitness (CRF) can be evaluated by estimating non-exercise models with the least amount of time and money. In this study, we present non-exercise equations for predicting CRF and also investigate the effect of socio-environmental factors on it. Materials & Methods: 2490 male and female subjects aged 25-65 years old from dif...

متن کامل

برآورد صحت انتخاب ژنومی در جوامع کوچک ژنتیکی- مطالعه‌ شبیه‌سازی

In the present study two genetically connected small and large populations were simulated and the effect of different sources of information from foreign populations on the accuracy of predicted genomic breeding values of young animals of the small population was investigated. A large population consist of 200000 animals over 15 generations and a small population consist of 5000 animals over 3 ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • International journal of epidemiology

دوره 43 3  شماره 

صفحات  -

تاریخ انتشار 2014